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ENGINEERING RFDOX PROTFIMR 

This invention relates to engineering proteins so that they have the required 
properties to carry out a particular function.. 
5 A very attractive goal for chemistry and biology is the creation of 

engineered protein structures capable of the same specific activities as their 
natural counterpart, but without the complexity associated with cellular functions. 
This has been addressed either by the de novo synthesis of peptides using 
spedfied templates, or by mutating wild type redox proteins or more recently by 
1 0 modular assembly of existing proteins. 

Protein engineering work has been carried out on many proteins including 
the E,cofi repressor of primer (rop) a regulatory protein which binds nucleic acid 
and has a 4-a-helix bundle motif. 

The 4-a-helix bundle is a commonly occurring, stable, structural motif. The 
4-a-helix bundle is typified by an association of 4-a-helices that cross at an angle 
of about 20». The helices may be linked by loops which can vary in their length, 
sequence and connectivity. The basic 4-heIix bundle framework can support a' 
diverse variety of activities, from oxygen transport in myohemerythrin to receptor 
binding in haematopoietic growth factors and RNA binding in rop. 

Wild type rop is dimeric. each monomer being formed by two helices 
connected by a sharp bend. Predki and Regan (1995) describe producing a single 
chain rop. Their method separates helix 1 and 2 sequences of one rop molecule 
and places helix 1 before helix V and places helix 2 after helix 2' of a second rop 
(consisting of helices r and 2'). The monomeric rop contains 4 a-helices In the 
order 1-r-2'-2 and all a helices are expressed as a single polypeptide chain. This 
method of ordering the 4 a helices maintains the wild type N- and C-termini and the 
connection between helix V and 2'. 

This invention provides a protein comprising: 

(a) a 4-a-helix bundle motif formed from the a-helices of rop (repressor of 

3 0 primer) and 

(b) a redox centre. 

The inventors have provided a stable protein capable of carrying out redox 
activities and other properties of the protein can be controlled. 
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This invention tal<es an existing simple and stable protein fold as a structure 
in whicli to insert new functions.. This approach mimics the natural process of 
protein evolution, where within a protein sujDerfamily with a particular stnjcture a 
diversity of function can be found. The protein of this invention is based on the 
5 well-studied and understood 4-helix-bundle protein fold. 

The protein of the current invention comprises a redox centre. A redox 
centre Is an atom, group of atoms or moiety capable of accepting and donating 
electrons. The redox centre may be a metal atom such as iron or copper which is 
stable in two different oxidation states. The redox centre may be an Iron sulphur 
10 centre which generally comprise two or four iron atoms bound to an equal number 
of sulphur atoms and to cysteine side chains. The redox centre may also be a 
haem group which comprises an iron atom held in the middle of a porphyrin 
structure. Alternatively the redox centre may not contain a metal and may be an 
organic redox centre such as FMN or FAD, 
15 The redox centre may be coordinated to the protein structure formed from 

the a-helices of rop. This may be the case when the redox centre is metal such, as 
an iron or copper atom. Other redox centres such as haem are associated to the 
a-helices of the protein scaffold. Residues such as hlstidlne, leucine, methionine 
and cysteine of the a-helices in the protein structure can be used to co-oidinate the 
10 iron atom of the haem as axial ligands when the redox centre Is haem. 

Residues suitable to coordinate to the redox centre are introduced into the 
helices of rop, and any residues present in the wild type sequence that could cause 
bonding to occur at an Inappropriate place are removed, by mutation of the 
polypeptide sequence. 

Simple natural cytochromes have reported mid-point potentials covering the 
range of -475 to +310 mV. The redox midpoint potential of the protein of this 
invention lies in the range -485 to +320 mV. The midpoint potential can be altered 
depending on the use of the redox protein. A major determinant of the redox 
potential of haem proteins is thought to be haem exposure. In addition other 
factors have also been implicated including the polarity of the haem environment, 
the strength of the axial ligand field, and the electrostatic Interactions between 
haem and its propionates. (Stellwagen (1978; Kassner, R.J. (1973); Valentino J.S. 
et al (1 979) & Moore. G.R. (1983)). It has also been demonstrated ttiat tiie redox 
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potential of cytochrome b^ea can be varied by over lOOmV by changing the amino 
acids in the vicinity of the haem, (Springs. S.L. et a/ (2000)). 

A second aspect of this invention provides a method of producing the 
protein of this Invention. Random mutagenesis or site directed mutagenesis is 
performed to engineer the protein such that it contains the necessary residues to 
enable redox centre binding to the protein. Therefore the DNA sequence which 
encodes the protein of this invention, either isolated or incorporated Into a vector is 
also part of the invention. This sequence is then expressed in and the polypeptide 
purified from a microorganism. (Alternatively it is possible that the polypeptide can 
be produced using a solid phase peptide synthesis). The resulting polypeptide 
Which is capable of binding a redox centre, is then Incubated with an excess of that 
redox centre to ensure Incorporation into the protein. 

A third aspect of this invention involves the use of the engineered protein 
The protein of the current invention may be used as an electron earner either on Its 
own or in a series of electron carriers. A series of electron carriers may be an 
electron transport chain, or it may include an electrode. An electron transfer chain 
transports electrons from a higher to a lower energy level along a series of electron 
camer molecules. An electron earner molecule is a molecule that tmnsfere an 
electron from a donor molecule to an acceptor molecule. An electron acceptor is a 
molecule that takes up electrons easily, thereby gaining an electron and becoming 
reduced, whereas an electron donor is a molecule that easily gives up an electron 
becoming oxidised in the process. Therefore the protein of this Invention can be ' 
used ,n a method which involves passing electrons along a sequence of electron 
earners, in which each electron earner Is reduced and then oxidised (or vice versa) 
by electron movement and the sequence of electron camera Includes ttie protein 
Of this invention. The electron transport chain may comprise natural or synthetic 
electron earners. The protein of this invention can be used In a mettiod Involving 
passing electrons along a sequence of electron earners. In which each election 
camer is reduced and then oxidised or vice versa by election movement and the 
protein of this invention fomis part of the sequence of electron carriers. In such a 
method electrons are generally moved along a gradient of electron earners with 
successively lower or higher redox potential. 

An alternative use of ttie protein of this Invention is as part of an apparatus 
comprising the protein associated with an electrode In a manner tiiat electrons may 
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be passed from one to- the next. The protein may be bound or adsorbed onto the 
electrode. This includes use of the protein of this invention in cyclic voltammetry. 
which is used to provide information about the redox protein's mid-point potential. 

As has been mentioned above the 4-a-helix bundle of rop protein fomis a 
stable structure. The redox protein of the present invention is also stable and has 
an unfolding free energy when denaturant is added to the protein of AG^^''2° 
wherein y^3.0 kcal/mol. The redox protein of this invention contains mutations 
which allow bonding of the redox centre. In addition to incorporation of a residue 
or residues for bonding to the redox centre a aromatic amino acids such as . 
tryptophan can be inserted into the protein. Insertion into the protein stnjcture of 
aromatic amino acids provides a method of spectrophotometric detection of the 
protein. It is necessary to ensure that the insertion of residues allowing 
spectrophotpmetric detection does not destabilise the protein. 

15 Brief description of the figures 

Figure 1 shows ribbon diagram representations of rop. The 3D models for 

monomeric rop were derived from the wild type dimeric rop model using the 

BIOPOLYMER and DISCOVERS modules within Insight //. 

Figure 2 shows the absorbance spectra of a bis-Histidine haem binding rop. The 
2 0 data were collected on a Hewlett-Packard 8453 UV-vislble photodiode an^y 

spectrophotometer. 

Figure 3 shows the electrochemical properties of the rop haem protein. 
Figure 4 shows the DNA and protein sequence of wild type dimferic rop with the a- 
helix regions shown in boxes. Also shown are psp7 amplification upstream 
2 5 sequence and asp4 amplification downstream sequence. 

Figure 5 shows the DNA and protein sequence of monomeric rop (S5S) (created by 
Predki & Regan) with the 

a-helix regions highlighted. Also shown are psp 7 amplification upstream 
sequence and asp4 amplification downstream sequence. 
Figure 6 shows a protein sequence of a redox proteln of the present invention 
fonmed from the a-helices of rop. produced according to example 1. 

In the preferred type of protein of this invention the 4-a-helix bundle 
structure used to develop the proteins of this invention is monomeric rop as 
described above. This proteih is developed from the wild type dimeric rop vMch is 
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Shown in Figure 4 and sequence ID Nos 5 and 6. In Figure 4 the portions of the 
sequences enclosed In boxes are the a-helices. In the momomeric protein all 4 
helices are expressed as a single polypeptide chain. Monomeric rop is shown in 
Figure 6 and sequence ID Nos 7 and 8. In Figure 5 the portions of the sequence 
enclosed in boxes are the a-helices. When specific residues are referred to the 
numbering system relates to the monomeric rop developed previously and shown 
in Figure 5. A structure for monomeric rop was generated by homology modeling 
using the program Insight II (MSI. San Diego) using the NMR structure as a 
template. This is shown in Figure 1 which shows the design of monomeric rop In 
this embodiment native dimeric rop (left) was engineered into a single polypeptide 
Cham monomeric variant (right) facilitated by the introduction of two 5 x Gly loops 
(Predki & Regan. 1995). The native His residues (76. 78. 107. 109) are shown In 
stick representation. In the protein of the present invention these His residues are 
removed as described below and residues necessary for interaction with the redox 
centre are introduced into the polypeptide. Removed means a particular amino 
acid IS no longer at that position. Preferably this is achieved via substitution 
however deletion may be a possibility. Introduced means a particular amino acid Is 
now found at that position. Preferably this is achieved via subtltution. however 
insertion may also be a possibility. Monomeric rop provides a better template for 
engineering than its dimeric precursor because of the ability to introduce single 
. mutations. 

As can be seen from Figure 5 the first two helices and the second two 
helices Of monomeric rep have the same sequence. The helix secondary structure 
may start one residue eariier in helix 4 (with A) than helix 3 because the GGGGG 
loop preceding it is long enough to allow reorientation, whilst the a-hellx secondary 
structure in helix 3 prebably begins at D as only one G is present between helices 
2 and 3, therefore A may still be part of the loop region. 

In the mutated monomeric rep proteins of this Invention the a-helix regions 
are generally conserved and preferably the helices of the 4-a-helix bundle of the 
protein of this invention show 6O0/0 or 70%. preferably 80 similarity or Identity with 
the a-helix regions indicated in Figures 4 and 5 and shown in sequence ID No. 1. 
2. 3 and 4. The. variation is generally introduced by removing native His residues 
76. 78. 107 and 109 as they could lead to inappropriate binding of a redox centre 
or by including residues required to incorporate the redox centre into the protein or 
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by altering residues in the vicinity of the redox centre. More variation may be 
introduced by residues which allow detection of the protein, for example Including 
aromatic amino acids such as tryptophan. The loops between each a-helix may, 
however, vary significantly (from that shown in Figure 5 and sequence ID Nos 7 
5 and 8) in terms of their length, sequence and connectivity, provided that the loop 
regions allow the a-helices to come together in 3D space to form the 4-a-helix 
bundle. 

The terms as used above, "similarity" and "identity" are l<nown in the art. 
The use of the temi "identity" refers to a sequence comparison based on identical 
1 0 matches between con-espondingly indentical positions in the sequences being 
compared. The tenn "similarity" refers to a comparison between amino acid 
sequences, and takes into account not only identical amino acids in corresponding 
positions, but also functionally similar amino acids in corresponding positioris. 
Thus similarity between polypeptide sequences indicates functional similarity, in 

15 addition to sequence similarity. 

Levels of identity between gene sequences and levels of Identity or 
similarity between amino acid sequences can be calculated using known methods. 
In relation to the present invention, publicly available computer based methods for' 
detemriining identity and similarity include the BLASTP. BLASTN and FASTA 

2 0 (Atschul et al.,J. Molec. Biol. . 1 990; 21 5:403-41 0), the BLASTX program available 
from NCBI, and the Gap program from Genetics Computer Group. Madison Wl. 

A more preferred protein of this invention is based on monomeric rop as 
described above and contains the following mutations, the histidines at H76 H78 
H107 and HI 09 are substituted for other residues. In one embodiment H76A and 

25 H 1 07A substitutions create a binding site for a haem group. In addition the 

mutations H78W and H109W may occur to create a means of spectrophotometric 
detection of the novel redox protein. Further mutations may also be incorporated 
into this protein for example L56H and L1 13H. The sequence of the especially 
preferred redox protein with mutations H76A, H78W. H107A, H109Wand L56H 

30 and L1 13H is given in sequence ID No. 1 1 and in Figur© 6. 

All the proteins of this invention are stable and have an urifolding free 
energy as described above. The stability of the proteins of this invention is 
comparable with or greater than the stability of wild type rop Itself. 
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Example 1 Production of monomeric rop nn^ s essina a ha^m g m. .p 
The monomeric rop constructs. (855). was the starting point for the rational 
design strategy (Munson et al 1994; PF Predki & Regan, 1995). Site directed 
mutagenesis was perfomied either by the "Megaprimer method" or using a method 
5 based on the "Quik Change-method (Stratagene Europe) (Trade mari<)in which a 
plasmid IS amplified using two complementary oligonucleotides containing the 
desired mutation. DNA manipulations were carried out in E.cofi DH5 a or £-co// 
XU-Blue (stratagene Europe). When the "Megaprimer method" was used the rop 
gene was amplified by PGR using flanking primers psp7 and asp4 (sequence ID 
10 No 9 and 10 respectively) and mutations were confirmed by sequencing of the 

PGR product. Oligonucleotides were obtained from Amersham Phamiada Biotech 
(Table 1). 



Table 1. OUgonuleotides used to enginee r the haem binding mutants. 

Oligonucleotide Forward QHc ,onucleotide Rever.. "— 

PSP/ gcggaaattaatacgactra — 

asp4 

H76A,H78A gaatcgcttgccgacgccgctgatg 8<^c88t88cagcagccaac 
H76A^78W gaatcgctlgctgactgggctgat 
20 H107A,H1.09W 

L56H ctcagacattaacgcacctagagaagcttaacg gagctcgttaagcttctctaggtgcgttaatgtctgag 

agctc 

Le^M ^g^g^atgagcattaccgcagctgcc ggcagclgcggtaalgctcatcagcggcg 

F12m . f ^<=«^cgagatgggggcgga^cagg cctgttcatccgcccccatcSStte 
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The initial step was to remove potential non-specific llgands by the 
replacement of endogenous histidine residues creating the mutant H76A H78W 
H107A. H109W. see Figure 1A. A haem binding site was created in this protein by 
the further mutations L56H. L113H. see Figure IB. Figure 1B shows the design of 
haem binding sites in monomeric rop. A monomeric rop variant (left) lacking the 
r^tive His residues (H76A. H78W. H107A. H109W) was produced by site directed 
mutagenesis. Mutations L56H. L113H were Incorporated to Introduce haem 
ligands. A sequential approach was taken to Incorporate multiple mutations into 
rop. Constructs leading to the creation of a haem-blnding rop variant are listed in 
Table 2 below. The RDM14.5 constmct. containing the (L56H. L113H) haem 
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binding site (rop.56H/113H) was made by.the subcloning of the appropriate 
fragments from RDIVI10-b4 and QC1 1-J1. 
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Table 2. Rop constmcts produced for this study with mutations from the S55 

sequraice. 



Construct Mutatio ns 

S55 



10 JW2 



JWl H107A, H109A 

JW3sub H76A, H78A, H107A, H109A 

JW2 H76A, H78W, H107A, H109W 

JW7 H76A,H78W,H107A,H109A 

RDM2.1 H76A, H78W. H107A, H109W, L63H 

QC3.5 H76A. H78W, H107A, H109W, L63R F121H 

QC7^ H76A, H78W. H107A, H109W, L63M, F121H 

^^^^^"^ H76A,H78W,H107A,H109W.L56H;G64R 

QCl 1-Jl H76A, H78W, H107A, H109W, LI 13H 

j H76A, H78W. H107A. H109W. L56a L113H 



The haem binding variant was heterologously expressed from the plVIR103 
Plasmid in E.co// BL21(ADE3) and purified to homogeneity by standard techniques 
Haem was incorporated into the apo-protein by addition of a 4.fold molar excess of 
hemin chloride solubilised in DMF. incubation overnight, and removal of free haem 
by iron exchange chromatography (DEAE sepharose FF. Amersham Phamiacia 
Biotech UK Ltd). The resulting haem binding rop protein has spectral properties 
characteristic of native b-type cytochromes; Figure 2 shows spectra of bis- 
Histidine haem bind rop variant in oxidized and reduced form. Solid line: 20mM 
haem binding monomeric rop variant LSSH. L113H in 10mM sodium phosphate 
buffer. PH80. Dashed line: 20mM rop following reduction by sodium dithionite The 
oxidized protein has a at 413 nm. which upon reduction with sodium dithionite 
shifts to 426 nm with distinct a and p peaks at 559 nm and 531 nm respectively 
The spectral characteristics are typical of b-type cytochromes with a six coordinate 
low spin iron, and this is a strong indication that the haem group has been 
incorporated Into the helix bundle as predicted in the design. 

Example 2. Testing the redox p m perties of thic: n p yei haem prnte.n 
The redox properties of tills novel haem protein rop-56H/li3H were 
detemnined by spectroelectrochemistry and cyclic voltammetry and are presented 
in Figure 3. Figure 3A shows redox titration of rop variant with bound haem. 
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Titration of sodium dithionite into protein solution (40pM) was performed 
anaerobically. Following equilibration the UV-visible spectrum was obtained 
(Hewlett-Pacl^ard 8453 spectrophotometer) and the potential measured 
simultaneously (Autolab PGSTAT10 potentiostat). Titrations were performed at 
5 20 ""C in pH 7.8 buffer. 

Figure 3B shows cyclic voltammograms of rop variant with bound haem. 
Cyclic voltammetry was performed on a bare glassy carbon.electrode over a range 
of scan rates. 

Spectrophotometic titrations were performed in an anaerobic a quartz 
10 cuvette (Hellma (England) Limited UK) equipped with a platinum-mesh working and 
an Ag/AgCI reference electrode, according to the procedure described in Dutton. 
P.L.. 1978. Ambient redox potentials were adjusted by addition of aliquots (<5m) 
of sodium dithionite or potassium ferricyanide. Titrations were performed in SOmlVI 
potassium phoshate. pH 7.5. Electrode-solution mediation was f^a'litated by the 
1 5 following mediators: phenazine methosulphate (5 pM). duraquinone (5 2- 

hydroxy 1.4 napthoquinone (5 pM) benzyl viologen (2 pM). indigocarmine (0.5 pM). 
resonjfln (0.5 pM). The modified cuvette was flushed with oxygen^free nitrogen 
before and during the measurements. After each addition of sodium dithionite (0.5 
Ml of a 10 mM solution) the protein was allowed to equilibrate and at each potential, 
2 0 the optical spectrum was recorded using a Hewlett-Packard 8452 diode array 
spectrophotometer. The fraction of the reduced protein was estimated at each 
potential by following the shift In Soret peak (413 to 426 nm) and was plotted 
against potential. The data points were fitted to the Nemst equation with n^l: 

E =Em +(RT/nF) In ([ox]/[red]) 

2 5 where [ox] and [red] are the concentration of oxidised and reduced haem 

respectively, E is the solution redox potential at equilibrium, n Is ttie number of 
electrons. Em is the mid-point potential. F is the Faraday constant. R is the gas 
constant and T Is the absolute temperature. 

The redox titrations with sodium dithionite (Figure 3A) show one larger 

3 0 component (78%) at -154 ± 2 mV and one smaller (22%) at 17 ± 9 mV. relative to 

the nomrial hydrogen electrode. The presence of two reduction potentials is likely to 
be due two non-equivalent orientations in the engineered rop. This is at present 
being investigated. Only one mid point potential of Em- -134 ±13 mV could be 
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measured using cyclic voltammetry on a bare glassy-carbon electrode using a 
modified Hagen cell (Figure 3B). 

These reduction potentials are consistent with those measured for natural 
cytochromes and heme-containing synthetic peptides, that cover the range of -475 
5 to +31 0 mV. The properties of the rop heme binding protein are at present being 
optimised with the creation of a cavity that accommodates the heme macrocycle 
with a unique orientation, with the possibility of tuning the redox potential. It has 
been demonstrated that the redox potential of cytochrome b562 can be varied by 
over 100 mV by changing the amino acids in the vicinity of the heme and Dutton 
10. and coworkers demonstrated that by utilizing alternative porphyrins it was possible 
to modulate the redox potential of their designed heme protein by as much as 225 
mV. It is anticipated that the redox properties of this designed prx)tein could 
likewise be "tuned" as required for particular applications. 

15 Example 3: Stability of m p 

The protein of this invention shows a high degree of stability which may be 
measured as the unfolding free energy when a denaturant Is added to the protein, 
of AGoi^ "2° is greater than or equal to y wherein yiS.O kcal/mol. 
The stability test refers to the chemical stability. This was measured In solution by 

20 detecting changes of the circular dichroism signal at 222 nm and changes of 
fluorescence anisotropy signal of the protein upon additions of guanidlnlum 
hydrochloride. Experiments were earned out by addition to the protein of 
denaturant (guanidlnlum hydrochloride) to reach final concentrations from zero to 
6.5 M. The parameter that Is related to the stability Is the AG of unfolding in water 

25 (A Gob, "2°). . 

The data derived from the CD and fluorescence anisotropy were analysed 
assuming a two state model, fp + f^ = 1. where fp and f^ represent the fraction of 
total protein in the folded and unfolded state respectively. The equilibrium constant 
Ku and the free-energy change AiGo^," for the unfolding reaction were calculated 

3 0 from: 

Ku = fu/(1-fu) = fu/fp 
Gobs" = - RT In Ku 

According to the linear extrapolation method, the unfolding free energy depend 
lineariy from the denaturant concentration (Shellman, J.A., 1987): 



wo 2004/050693 



PCT/GB2003/005256 



11 



u - 



Gobs^ " Gobs 2 

The results are shown in table 3. 



H O 



m * [GuHCI] 



Table 3: Thermodynamic parameters characterizing the unfolding process 



Sample 


DG1 
(kcal/mol) 


(kcal/mol M) 


DG2 
(kcal/mol) 


m2 

(kcal/mol M) 


S55 


8.7±0.5 


2.4±0.3 






JW2 


8.7±0.8 


1.9±0.5 






QC7.2 


3.1 ±0.5 


2.9±0.2 


3.9±0.5 


1.1±0.3 


RDM14.5 


3.5±0.6 


3.0±0.9 


4.0±0.5 


1.0±.0.2 



10 



15 



20 



The mutant Jw2, the parent from which all of the mutants containing the 
haem have been obtained, shows a AG of unfolding of 8.7 kcal/mol. as also 
shown for the original monomeric rop S55, 8.7 kcal/mol. 

A two-step transition model was necessary to adequately fit the data of 
RDM14.5 and QC7.2, resulting In a total free energy change of 7.5±1.1 kcal/mol 
and 7.0±1.0 kcal/mol respectively (see table 3). The unfolding of the starting 
constnjct proteins S55 and JW2 have been also reported for comparison in table. 
In these cases the unfolding process was satisfactorily represented by a simple 
two-state transition with similar free energy changes (table 3). These findings 
suggested that the haem bound proteins undergo a different unfolding mechanism 
due to the presence of the haem group. This hypothesis has been tested 
measuring the changes of QC7.2 and RDM14.5 absorption spectrum In the visible 
region, as a function of guanidinium hydrochloride (GdHCI) concentration. Most of 
the signal at 415 nm is already lost at 3 M GdHCI. confinning the loosening of the 
haem group. 
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1. A protein comprising: 

a) 4-a-hellx bundle motif fomied from the a-helices of ROP (repressor of 
primer) and 

b) a redox centre. 



2 

10 haem 



The protein of Claim 1 wherein the redox centre is a metal, preferably iron 
or copper, is an iron-sulphur centre, haem, FMN or FAD and is preferably 



3. The protein of Claim 1 or 2 wherein the redox centre is bound to the protein, 
preferably coordinated by one or more of histidine, leucine, methionine or 
cysteine residues and more preferably by 2 histidine residues. 

4. The protein of Claim 1 or 2 wherein the redox centre is covalently bound to 
the protein. 

5. The protein of Claim 1 which has a redox mid-point potential In the range 
20 -485 to +320mV. 

6. The protein of Claim 1 which has a-helix regions each having 60%, 
preferably 70% and more preferably 80% similarity or identity with the a- 
hellx regions of sequence ID Nos. 1 and 3. 

25 

7. The protein of daim 6. wherein said four a-helix regions are connected by 
loops. 

8. The protein of claim 7, wherein the four a-hellces are joined in the order 1- 
30 1'.2-2. 

9. The protein of Claim 1 which is formed by connecting two wild type ROP 
proteins to obtain the 4-helix bundle as one continuous polypeptide having 
60%, preferably 70% and more preferably 80% similarity or identity with 

35 sequence ID No. 8. 



10. 



The protein of claim 9, wherein the histidine residues con-esponding to H76, 
H78, H107 and H109 in sequence ID No. 8 are removed. 



wo 2004/050693 ^ PCT/GB2003/005256 

14 

11 . The protein of claim 9 or 1 0, wlierein histidlne, leucine, methionine or 
cysteine residues are introduced one or both positions conresponding to 56 
and 1 13 in sequence ID No. 8, preferably histidines are introduced at both 
positions 56 and 1 13. 

5 . . 

12. The protein of any preceeding claim which has a haem redox centre co- 
ardinated to the 4-a-hellx bundle motif via two histidlne residues. 

1 3. The protein of claim 12 which has a rnld-point potential in the range -400mV 
10 to+300MV. 

14. The protein of claim 12 which has ttje sequence as indicated by sequerlce 
ID No. 11. 

15 15. The protein of any one of claims 1 to 14 which has a stability, measured as 
the unfolding free energy when denaturant is added to the protein, of 
AGote"2°2 wherein yk3.0 kcal/moL 

1 6. A method of producing tiie protein of any one of claims 1 to 1 5 comprising 
20 1) expressing all four a-helices as a single polypeptide chain; 

ii) engineering tiie required mutations to enable redox centre binding; 

iii) expressing and purifying.or producing ttie redox centre binding 
mutant; 

iv) Incubating tfie protein with an excess of the redox centre. 

25 

17. A nucleotide sequence which encodes the protein of any one of claims 1 to 
15 or a fragment thereof. 

18. A vector comprising the nucleotide sequence of daim 1 7, 

30 

1 9. Use of the protein of any of claims 1 to 1 5 In a sequence of electron camera 
and preferably an election ta^nsfer chair). 

20. A metfiod of passing electrons along a sequence of electron earners, in 
3 5 which, each electron carrier is reduced and tiien oxidised or vice versa by 

elertron movement and the sequence of electron carriers includes the 
protein of any one of claims 1 to 15. 
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21 . An apparatus comprising the protein of any one of claims 1 to 1 5 
associated with an electrode. 



22. 

5 



An apparatus according to claim 21 wherein the protein is adsorbed onto 
electrode. 
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Figure 3A 
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Figure 3B 
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Sequence of dimeric rop 



(psp7 aniplification i^stzeam sequence) 

^^^^^^^^SB? TATAGGG6AA ITSTOAGCGG ATAACAATTC 
CCCTCTA6CT A6AAATAATT TTGTTXAACT TTAAGAAGGA GATATACC 



1 

M 

ATG 


G 

GGT 


T 

ACC 


K 

AAA 


Q B K T 

CAA GAA AAA ACC 


10 
A L 

GCC CTT 


K 

AAC 


K A 

ATG GCC 


R 

CGC 


20 

F I R S Q T 
TTT ATC AGA AGC CAG ACA 
























■21 
L 

TTA 


T 

ACQ 


Z. 

CTT 


Ii 

CTG 


E K L N 

GAG AAA CTC AAC 


30 
B Z. 
GAG CTG 


D 

GAC 


A 

GCG 


D 

GAT 


B 

GAA 


40 

.0 A D Z C B 

CAG GCA GAC ATC TGT GAA 
























41 
S 

TCG 


Zi 

CTT 


H 

CAC 


D 

GAC 


B A D B 

CAC GCT GAT GAG 


50 

Z. Y R 

CTT TAG CGC 


S 

AGC 


C 

TGC 


Zi 

CTT 


€0 

A R F G n D 

GCC CGT TTC GGC GAC GAC 



61 

6 B N Zi - stop 
GGT GAA AAC CTG TAG 

GGATCCGGCT GCTAACAAA6 CCCGAAAGGA AGCTO AGTTG GCTGCTGCCA CCGCTGAGC 

(asp4 amplificatioii downstieam sequence) 



Figure 4 
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Sequence of monomeric S5S rop 



(psp7 amplification upstream sequence) . . 

p^^ P^^l ^^p^ P^^P^ TAXAGGGGIkA TT6T6AGCG6 
CCCTCTAGCT AGAAATAATT TTGTTTAACT TTAAGAAGGA GATATACC 



X 

M 6 

ATG GGT 



' •• • : : 2^ 

TKQBKTALNMARP X RSQT 
ACC AAA CAA GM. AAA ACC GCC CTT AAC ATG GCC CGC TTT ATC AGA AGC CAG ACA 



2X 30 
Xi T.IiLSKLNEIf 
TTA ACG CTT CTG GAG AAA CTC AAC GAG CTG 



6 G G G Q 

GGT GGC GGT GGC GGT 



'■ w 

T R Q E K 

ACC AAA CAA GAG AAG 



41 SO . 55" 

TAZ-K MAKFX RS'QTLTI. X.BKI« 
ACC GCC CTT AAC ATG GCC CGC TTT ATC AGA TCT CAG ACA TTA ACG CTT CTA GAG AAG CTT 



■^1 

N E 
AAC GAG 



Ii 

CTC 



G A 

GGG GCG 



: 80 

DEQA. DZCESZiB DHAD 
GAT GAA CAG GCA GAC ATA TGT GAA TCG CTT CAC GAC CAC GOT GAT 



81 90 
EZ.YRSCLARF 

GAG CTT TAC CGC AGC TGC CTT GCC CGT TTC 



G G G G G 

GGT GGC GGT GGC GGT 



A D S Q A 

GGG GAT GAA CAG GCA 



101 110 £20"" 

DICES- H D H AD EIi Y R S C Xt A R 

GAC ATC TGT GAA TCG CTT CAC GAC CAC GCT GAT GAG CTT TAC CGC AGC TGC CTT GCC CGT 



T21 








F 


G D D 


G E K X. - 




TTC 


GGC GAC GAC 


GGT GAA AAC CTG 


TAG 



GSATCCGGCT GCTAACRAAG CCCX31AAW3GA ASCTG aGTTO GCTBCTGCCA CCGCTGAGC 
(asp4 ampHfication downstitani seqnenoe) 



Figures 
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10 ■ 20 30 40 50 

....I, ...I 

RDM14-5 MGTKQEKTAL NM?IRFIRSQT LTLLEECLNEL GGGGGTKQEK TALNMARFIR 



60 70 80 90 100 

BDM14-5 SQTLTHI.EKL NELGADEQAD XCESIADK2UD ELYRSCLARF GGGGGADEQA 



110 120 130 
I I I I ....I I . 

RDM14-5 DICESLADHA DEHYRSCLAR FGDDGENL^ 



Figure 6 
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SEQUENCE LISTING 

<110> Imperial College Innovations Ltd 

<120> Engineering Redox Proteins 

<130> HMJ03488WO 

<140> 
<141> 

<160> 11 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 84 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
<222> (1) , . (84) 

<223> flelix 1 of E.coli repressor of primer (rop) 
<400> 1 

acc aaa caa gaa aaa acc gcc ctt aac atg gcc cgc ttt ate aga age 48 
Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala Arg Phe He Arg Ser 
^ 5 10 



15 



cag aca tta acg ctt ctg gag aaa etc aac gag ctg 84 
Gin Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu 
20 25 



<210> 2 
<211> 28 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Thr Lys Gin Glu Lys Thr Ala Leu 
1- 5 

Gin Thr Leu Thr Leu Leu Glu Lys 
20 



Asn Met Ala Arg Phe He Arg Ser 
10 15 

Leu Asn Glu Leu 
25 
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<210> 3 
<211> 84 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
<222> (1) . . (84) 
<223> Helix 2 of rop 

<400> 3 

gat gaa cag gca gac ate tgt gaa teg ctt cac gac cac get' gat gag 
Asp Glu Gin Ala Asp lie Cys Glu Ser Leu His Asp His Ala Asp Glu 
1 .5 ' 10 15 

ctt tac cgc age tgcett gcc cgt ttc ggc gac gac 
Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Asp Asp 
20 . 25 



<210> 4 
<211> 28 
<212> PRT 

<213> Escherichia coli 
<400> 4 

Asp Glu Gin Ala Asp lie Cys Glu 
1 5 - 

Leu Tyr Arg Ser Cys Leu Ala Arg 
20 



Ser Leu His Asp His Ala Asp Glu 
10 15 

.Phe Gly Asp Asp 
25 



<210> 5 
<211> 192 
<212> DMA 

<213> Escherichia coli 

.<220> 
<221> CDS 
<222> (1) . . (192) 
<223> wild type dimeric rop 

<.400> 5 

atg ggt ace aaa caa gaa aaa acc 
Met Gly Thr Lys Gin Glu Lys Thr 



gcc ctt aac atg gcc cgc tttatc 48 
Ala Leu Asn Met Ala Arg Phe lie ' 
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15 



aga age cag aca tta acg ctt ctg- gag aaa etc aac gag ctg gac gcg 96 
Arg ser Gin Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu Asp Ala 

25 . 30 

gat gaa cag gca gac ate tgt gaa teg ctt eac gac cac get gat gag 144 
Asp Glu Gin Ala. Asp lie Cys Glu Ser Leu His Asp His Ala Asp Glu 
35 40 45 

ctt tac .cge age tgc ctt gee cgt ttc ggc gac gac ggt gaa aac ctg 192 
Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Asp Asp Gly Glu Asn Leu 
• 55 60 



<210> 6 
<211> 64 
<212> PRT 

<213> Escherichia coli 
<400> 6 

Met Gly Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala Arg Phe He 

15 



5 10 



Arg ser Gin .Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu Asp Ala 



20 



25 



30 



Asp Glu Gin Ala Asp He Cys Glu Ser Leu His Asp His Ala Asp 
35 40 45 

Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Asp Asp Gly Glu Asn 

55 60 



Glu 



Leu 



<210> 7 
<2il> 384 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Monomeric 
containing all 4 helices in one continuous 
sequence 

<220> 

<221> CDS 

<222> (I) . . (384) 
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<223> Monomeric rop consisting of helices 1-1 '-2 '-2 and 
helices 1 and 1 • , and 2 • and 2 are connected by 
GGGGG loops 

<400> 7 

atg ggt acc aaa caa gaa aaa acc gcc ctt aac atg gcc'cgc ttt ate 

Met Gly Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala Arg. Phe He 
^ . 5 .10 15 

aga age cag aca tta acg ctt ctg gag aaa etc aac gag ctg ggt ggc 
Arg Ser Gin Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu Gly Gly 
20 25 30 



gcg gat gaa cag gca gac ata tgt gaa teg ctt cac gae cac get gat 
Ala Asp Glu Gin Ala Asp He Cys Glu Ser Leu His Asp His Ala Asp 
" 70 75 30 

gag ctt tac cgc age tgc ctt gee cgt ttc ggt ggc ggt ggc ggt gcg 
Glu Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Gly Gly Gly Gly Ala 
85 90 95 

gat gaa cag gca gae ate tgt gaa teg ctt cac gac cac get gat gag 
Asp Glu Gin Ala Asp He Cys Glu Ser Leu His Asp His Ala Asp Glu 
100 105 110 

e.tt tac cgc age tgc btt gcc cgt ttc ggc gac gac ggt gaa aac ctg 
Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Asp Asp Gly Glu Asn Leu 
115 120 125 



48 



96 



ggt ggc ggt acc aaa caa gag aag acc gee ctt aad atg gcc cgc ttt 144 
Gly Gly Gly Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala Arg Phe 
35 40 45 

ate aga tet cag aca tta acg ctt eta- gag aag ett aac gag etc ggg 192 
He Arg Ser Gin Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu Gly 
50 55 • 60 



240 



288 



336 



384 



<210> 8 
<211> 128 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Monomeric rop 
containing all 4 helices in .one continuous 
sequence 

<400> 8 

Met Gly Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala Arg Phe He 
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15 10 15 

Arg Ser Gin Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu Gly Gly 
20 25 30 

• Gly Gly Gly Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala . Arg Phe 
35 40 45 

lie Arg Ser Gin Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu Gly 
50 55 60 

Ala Asp Glu Gin Ala Asp lie Cys Glu Ser Leu His Asp His Ala Asp 
65 70 75 80 

Glu Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Gly Gly Gly Gly Ala 
85 90 95 

Asp Glu Gin Ala Asp lie Cys Glu Ser Leu His Asp His Ala Asp Glu 
100 105 110 

Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Asp Asp Gly Glu Asn Leu 
115 120 125 



<210> 9 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> . 

<223> Description of Artificial Sequence: psp7 upstream 
aiK^lification seG[uence 

<400> 9 

.gcgaaattaa tacgactca 



<210> 10 
<211>. 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: asp4 
downstream amplification sequence 

<400> 10 
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23 



WO 2004/050693 

gttggctgc.t gccaccgctg age 

<210> 11 
<211> 128 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: RDM14.5 
<40b> 11 

Met Gly Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala Arg Phe lie 
^ .5 10 ■ 15 

Arg Ser bin Thr Leu Thr Leu Leu Glu Lys Leu Asn Glu Leu Gly Gly 
20 25 30 

Gly Gly Gly Thr Lys Gin Glu Lys Thr Ala Leu Asn Met Ala Arg Phe 
35 40 .• 45 ■ . 

He Arg Ser Gin Thr Leu Thr His Leu Glu Lys Leu Asn Glu Leu Gly 
50 55 60 

Ala Asp Glu Gin Ala Asp lie Cys Glu Ser Leu Ala Asp Trp Ala Asp 
" 70 75 80 

Glu Leu Tyr Arg Ser Cys Leu Ala Arg Phe Gly Gly Gly Gly Gly Ala 
85 90 gg . 

Asp Glu Gin Ala Asp He Cys Glu Ser Leu Ala Asp Trp Ala Asp Glu 
100 105 110 

His Tyr Arg Ser Cys Leu Ala Arg Phe Gly Asp Asp Gly Glu Asn Leu 
■ 115 120 125 
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